Hybridization Number on Three Trees

نویسندگان

  • Leo van Iersel
  • Steven Kelk
  • Nela Lekic
  • Chris Whidden
  • Norbert Zeh
چکیده

Phylogenetic networks are leaf-labelled directed acyclic graphs that are used to describe nontreelike evolutionary histories and are thus a generalization of phylogenetic trees. The hybridization number of a phylogenetic network is the sum of all indegrees minus the number of nodes plus one. The Hybridization Number problem takes as input a collection of phylogenetic trees and asks to construct a phylogenetic network that contains an embedding of each of the input trees and has a smallest possible hybridization number. We present an algorithm for the Hybridization Number problem on three binary phylogenetic trees on n leaves, which runs in time O(ckpoly(n)), with k the hybridization number of an optimal network and c some positive constant. For the case of two trees, an algorithm with running time O(3.18kn) was proposed before whereas an algorithm with running time O(ckpoly(n)) for more than two trees had prior to this article remained elusive. The algorithm for two trees uses the close connection to acyclic agreement forests to achieve a linear exponent in the running time, while previous algorithms for more than two trees (explicitly or implicitly) relied on a brute force search through all possible underlying network topologies, leading to running times that are not O(ckpoly(n)), for any c. The connection to acyclic agreement forests is much weaker for more than two trees, so even given the right agreement forest, the reconstruction of the network poses major challenges. We prove novel structural results that allow us to reconstruct a network without having to guess the underlying topology. Our techniques generalize to more than three input trees with the exception of one key lemma that maps nodes in the network to tree nodes and, thus, minimizes the amount of guessing involved in constructing the network. The main open problem therefore is to prove results that establish such a mapping for more than three trees. ∗Centrum Wiskunde & Informatica (CWI), Science Park 123, 1098 XG Amsterdam, The Netherlands, [email protected] †Leo van Iersel and Nela Lekić were respectively funded by Veni and Vrije Competitie grants from The Netherlands Organisation for Scientific Research (NWO). ‡Department of Knowledge Engineering (DKE), Maastricht University, P.O. Box 616, 6200 MD Maastricht, The Netherlands, [email protected], [email protected] §Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., PO Box 19024, Seattle, WA, USA 98109, [email protected] ¶Faculty of Computer Science, Dalhousie University, 6050 University Ave, Halifax, NS B3H 1W5, Canada, [email protected] 1 ar X iv :1 40 2. 21 36 v1 [ cs .D S] 1 0 Fe b 20 14

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hybridization Number on Three Rooted Binary Trees is EPT

Phylogenetic networks are leaf-labeled directed acyclic graphs that are used to describe nontreelike evolutionary histories and are thus a generalization of phylogenetic trees. The hybridization number of a phylogenetic network is the sum of all in-degrees minus the number of nodes plus one. The hybridization number problem takes as input a collection of rooted binary phylogenetic trees and ask...

متن کامل

Fast computation of minimum hybridization networks

MOTIVATION Hybridization events in evolution may lead to incongruent gene trees. One approach to determining possible interspecific hybridization events is to compute a hybridization network that attempts to reconcile incongruent gene trees using a minimum number of hybridization events. RESULTS We describe how to compute a representative set of minimum hybridization networks for two given bi...

متن کامل

Kernelizations for the Hybridization Number Problem on Multiple Nonbinary Trees

Given a finite setX, a collection T of rooted phylogenetic trees onX and an integer k, the Hybridization Number problem asks if there exists a phylogenetic network on X that displays all trees from T and has reticulation number at most k. We show two kernelization algorithms for Hybridization Number, with kernel sizes 4k(5k) and 20k(∆ − 1) respectively, with t the number of input trees and ∆ th...

متن کامل

Fast Computation of the Exact Hybridization Number of Two Phylogenetic Trees

Hybridization is a reticulate evolutionary process. An established problem on hybridization is computing the minimum number of hybridization events, called the hybridization number, needed in the evolutionary history of two phylogenetic trees. This problem is known to be NP-hard. In this paper, we present a new practical method to compute the exact hybridization number. Our approach is based on...

متن کامل

Cherry picking: a characterization of the temporal hybridization number for a set of phylogenies.

Recently, we have shown that calculating the minimum-temporal-hybridization number for a set [Formula: see text] of rooted binary phylogenetic trees is NP-hard and have characterized this minimum number when [Formula: see text] consists of exactly two trees. In this paper, we give the first characterization of the problem for [Formula: see text] being arbitrarily large. The characterization is ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1402.2136  شماره 

صفحات  -

تاریخ انتشار 2014